A SAS/IML Macro for Goodness-of-Fit Testing in Logistic Regression Models with Sparse Data
نویسنده
چکیده
The logistic regression model has become the standard analyzing tool for binary responses in a variety of disciplines. Methods for assessing goodness-of-fit, however, are less developed and this is especially pronounced in calculating goodness-of-fit tests with sparse data, when the standard tests (deviance and Pearson test) behave unsatisfactorily. In our paper we show two solutions to the problem that are implemented in the LOGISTIC procedure in SAS software, and introduce five additional testing procedures from the statistical literature. By means of a simulation study we show that these additional tests are valid instruments for assessing goodness-offit in logistic regression models, even with sparse data. Finally, we present the SAS/IML macro %GOFLOGIT which allows calculation of the introduced tests and illustrate the macro with an example from occupational epidemiology on hand eczema in hairdressers.
منابع مشابه
Robust Standard Error Estimate for Cluster Sampling Data: A SAS/IML Macro Procedure for Logistic Regression with Huberization
Data sets with cluster structure are very common in practical business and research, one has to take into account the intra-cluster correlation in data analysis. This paper systematically discusses the Huber/White standard error estimate for cluster sampling data in logistic regression, and presents a user-friendly SAS/IML macro procedure which can automatically fit logistic model, calculate ro...
متن کاملSAS Macro for Generalized Method of Moments Estimation for Longitudinal Data with Time-Dependent Covariates
Longitudinal data with time-dependent covariates is not readily analyzed as there are inherent, complex correlations due to the repeated measurements on the sampling unit and the feedback process between the covariates in one time period and the response in another. A generalized method of moments (GMM) logistic regression model (Lalonde, Wilson, and Yin 2014) is one method to analyze such corr...
متن کاملA Macro for Computing a Goodness of Fit Statistic for Linear Mixed Models
In the SAS System, PROC MIXED offers users the possibility to perform analysis of complex data where assumptions of traditional analysis of variance (ANOVA) methods such as homogeneity of variance or independence of error terms might be violated. Thus, this procedure can be used to analyze data where the observations are assumed to come from a normal distribution but are correlated such as in l...
متن کاملEstimate-based goodness-of-fit test for large sparse multinomial distributions
The Pearson’s chi-squared statistic (X2) does not in general follow a chi-square distribution when it is used for goodness-of-fit testing for a multinomial distribution based on sparse contingency table data. We explore properties of Zelterman’s (1987) D2 statistic and compare them with those of X2 and we also compare these two statistics and the statistic (Lr) which is proposed by Maydeu-Oliva...
متن کاملComparison of ordinary logistic regression and robust logistic regression models in modeling of pre-diabetes risk factors
Background: Regarding the increased risk of developing type 2 diabetes in pre-diabetic people, identifying pre-diabetes and determining of its risk factors seems so necessary. In this study, it is aimed to compare ordinary logistic regression and robust logistic regression models in modeling pre-diabetes risk factors. Methods: This is a cross-sectional study and conducted on 6460 people, over ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001